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(54) High speed serial link for fully duplexed data communication 

© A system for converting between parallel data and serial data is described. In the system, individual 
bits of the parallel data are latched into individual registers. Each register is coupled to a corresponding 
AND gate which is also connected to receive phased clock signals. The output terminals of the AND 
gates are connected to an OR gate. Using the system, with appropriately phased docks, the parallel 
data is converted into serial data. 
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bAuKliKUUND OF THE INVENTION J 

t 

This invention relates to a system for providing a high speed serial communications link allowing fully du- 
plexed serial data communication. In particular, this invention relates to an interface circuit for interconnecting 
5 devices with parallel datapaths via such a serial link. 

As electronic and computer technology continues to evolve, communication of information among different 
devices, either situated near by or at a distance becomes increasingly important. For example, it is now more 
desirable than ever to provide for high speed communications among different chips on a circuit board, different 
circuit boards in a system, and different systems with each other. It Is also increasingly desirable to provide 
w such communications at very high speeds, especially in view of the large amount of data required for data com- 
munications in intensive data consuming systems using graphical or video information, multiple input-output 
channels, local area networks, and the like. • ' 

It is particularly desirable to enable individual personal computers, workstations, or other computing de- 
vices, within which data is normally internally transferred using parallel data buses, to communicate with each 
is other over relatively simple transmission lines. Such transmission lines typically include only one or two con- 
ductors, in contrast with the 64-bit and wider data paths within computing systems now commonly available. 

There have been a number of commercially available products which attempt to provide high speed con- 
version of parallel data to serial form and transmission over a serial link. The Hewlett-Packard G-link chip set 
is one such product. That chip set includes a transmitter set and is capable of handling 20- or 24-bit wide parallel 
20 data. To obtain the necessary speed, however, the chip set is fabricated using a bipolar process, and the re- 
ceiver and transmitter require separate chips. Such a solution is highly power consumptive and expensive. It 
also employs a conventional approach to parallel-to-serial data conversion, that is, the use of a phase locked 
loop oscillator operating at the transmission rate. Such devices typically introduce noise into the silicon sub- 
strate and interfere with other phase-locked loop circuitry on the chip. This makes it difficult to integrate many 
25 channels on a single chip. 

Another commercial solution has been provided by Bull of France. The Bull technology employs a frequen- 
cy multiplier for parallel to serial data conversion. Such devices typically introduce noise into the silicon sub- 
strate and interfere with other multipliers on the chip. In addition, the Bull technology uses an exclusive OR 
tree for parallel to serial conversion. The use of exclusive OR trees is well known, together with the difficulty 
30 of equalizing the delay through all paths of such devices. The Bull technology employs a delayed lock loop 
circuit that mandates the use of a special coding scheme which couid result in reduced coding efficiency. 

SUMMARY OF THE INVENTION 

35 This invention provides a very high speed data serializer capable of converting parallel data to serial data 

at speeds greater than one gigabit per secondhand a data/clock recovery circuit that does not require an indi- 
vidual independent clock for each channel. Despite enabling data conversion at extremely high rates, the in- 
vention can be fabricated relatively inexpensively using well known complementary MOS technology, as op- 
posed to gallium arsenide, bipolar, and other technologies conventionally employed to fabricate such high 

40 speed devices. Also, the invention provides a technique for converting serial data to parallel data in which only 
a single oscillator is required for many channels, thereby eliminating the possibility of injection effects which 
make all the oscillators attached to individual channels falsely synchronize to a single frequency in prior art 
devices. In addition, the technique employed avoids the use of exclusive OR trees, and their accompanying 
difficulty of equalizing delay paths. In the preferred embodiment, the parallel-to-serial conversion is achieved 

45 using one or more latches to latch data in the parallel data stream before conversion. The output terminal of 
each latch is connected to a corresponding AND gate. Other terminals of each AND gate are connected to re- 
ceive phased clock signals. When the dock signals are enabled, the latched data is supplied at the output ter- 
minal of the AND gate, which is in turn coupled to an input terminal of multiple input terminal OR gate. As the 
clock signals are phased, data from each sequential AND gate is supplied to the OR gate, and then serially 

50 from the OR gate to a suitable transmitter or other apparatus. 

In one embodiment, a circuit for converting N bits of parallel data into a serial data string includes a register 
having at least N storage locations for temporarily storing the parallel data. A source of an appropriate number 
of clock signals, with each clock signal having a different phase than every other dock signal is provided, and 
a different dock signal is connected to each one of a series of AND gates. Each of the AND gates is also con- 

55 nected to receive an appropriate one of the bits of parallel data. An OR gate with the corresponding number 
of input nodes is then coupled to the output of each AND gate. By appropriate phasing of the dock signals, 
the data presented at the input terminals of the AND gates is converted to serial form by the OR gate and sup- 
plied to a transmitter or other apparatus. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a block diagram illustrating at a high level the serial link interface system; 
Figure 2 is a more detailed block diagram illustrating the serial link interface; 
5 Figure 2a is a more detailed diagram illustrating the arrangement of clock conductors within Figure 2; 

Figure 3 is a block diagram illustrating the clock generator 
Figure 4 is a detailed circuit schematic illustrating the clock generator; 

Figure 5 is a timing diagram illustrating the relationship of the phased clocks produced by the circuitry of 
Figures 3 or 4; .„•.••<:•,-.• .*.* 

10 Figure 6 is a block diagram illustrating the data serializer, 

Figure 6a illustrates the detailed structure of the 20-tc-40 multiplexer; 
Figure 6b illustrates the timing diagram of multiplexer operation; 
Figure 7 is a detailed circuit schematic of.the data serializer; 

Figure 8 is a detailed circuit diagram of a single-ended embodiment of the bidirectional buffer; 
is Figure 9 is a detailed circuit diagram illustrating a differential embodiment of the bidirectional buffer, the 

impedance matching circuit and current bias generator; 
Figure 10 is a block diagram illustrating the receiver; 

Figure 11 is a diagram illustrating a hypothetical waveform on the serial link and the manner in which it is 
sampled; 

20 Figure 1 2 is a detailed circuit schematic of a sense amplifier used to sense the state of the serial link; 

Figures 13a and 13b illustrate the clock selection process based on the relative speeds of the transmitter 
and receiver; 

Figure 14 is a detailed circuit schematic illustrating the phase pointer; • 

Figure 1 5 is a block diagram illustrating the connection between the word and phase pointers; - 
25 Figure 16 is a timing diagram illustrating the reset signal; 

Figure 1 7 is a block diagram illustrating the clock multiplexer; 

Figure 18 is a circuit diagram illustrating the data shuffler; 

Figure 19 is a diagram illustrating the data selector; 

Figure 20 is a circuit schematic of the voter; - 
30 Figure 21 is a circuit schematic of the tracking filter; 

Figure 22 presents the overall transmitter timing; and 

Figure 23 presents the overall receiver timing. 

DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

35. 

Figure 1 is a high level block diagram of a system which provides a serial link for fully duplexed high speed 
data communication. As shown in Figure 1, the system 10 is coupled to a transmission line 100. Generally, 
the system includes a parallel interface 12 which is coupled to desired other apparatus. For example, interface 
12 may be coupled to the bus of a workstation, a personal computer, high definition television, a local area 

40 network, a printed circuit board or the like. In essence, interface 12 may be coupled to any outside apparatus 
which provides data in parallel over a series of lines or "bus," Once data is received at the interface, it is then 
supplied to a parallel-to-serial converter 15. Converter 15 converts the. parallel bit stream from interface 12 
into a serial bit stream and supplies it differentially to a driver circuit 20 capable of driving the signals on the 
transmission line 1 00. From driver circuit 20 the signals are transferred to a two-way buffer 25 which is coupled 

45 directly to the transmission line 100. The two-way buffer 25 enables signals to be supplied to the transmission 
line and received from the transmission line simultaneously. The path from converter 15 through driver 20 and 
buffer 25 is provided for outgoing data, that is, data which is to be placed on the transmission tine and sent to 
a remote chip, PC board, system, or other apparatus. Transmission line 100 is a twinax or twisted pair and 
typically functions as a point-to-point link. 

50 Incoming data on the transmission line, which can be present simultaneously with .the outgoing data, is 

, extracted by the two-way buffer 25 from the combined incoming/outgoing signal on the media 100. The in- 
coming data is then supplied through driver 30 to serial-to-parallel converter 34. Converter 34 accepts the serial 
data, supplied in differential form on lines 32. samples it and converts it to parallel data. After conversion the 
data is supplied to a data separator 40 which extracts the appropriate bits from the oversampled data bits and 

55 places them on an output interface 42. In a manner similar to input interface 12, output interface 42 is connected 
to an external computer system, or other desired apparatus to which the data is to be supplied. 

. Figure 2 is a more detailed block diagram of the serial link with four channels. Figure 2 illustrates in greater 
detail the interrelationship between the components of the system 10,- together with the manner in which a 
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group of serial link systems can be commonly controlled to interface with a group of serial links. Also shown 
is the manner in which a common clock signal and common impedance matching circuit may be shared among 
several channels of serial links. 

Each channel of the serial link, for example channel 0, includes a parallel interface 12 from an external 
5 system 50, together with an interface 42 for providing data to the external system 50. In operation, the data 
on interface 12 is supplied to a transmitter, the detailed structure of which is described below. A bidirectional 
buffer 25 receives the serial data from the transmitter and places it on the serial link transmission line 100. 
Similarly, serial data from the transmission line 100 is received by the bidirectional buffer 25, sampled by a 
sampler 34, and then supplied to a receiver 52 for conversion to parallel data. A parallel bus is provided to the 

w external system 50 from the receiver. ■' • 

A high speed clock 60, described in more detail belowj is coupled both to the serial link 100 and to a series 
of conductors 62 for distributing the clock signals among'the transmitters. The receivers are also coupled to 
clock line 62. The high speed clock is an important part of the system. In conventional data serializers, D-type 
flip-flops or latches are configured as parallel-in serial-out shift registers. Unfortunately, when using such an 

15 approach, it is difficult to manufacture a very high speed data serializer because of the difficulty of generating 
and handling dock signals whose frequency equals the data rate. It is also difficult to design the flip-flops or 
latches to operate at the speed of such a dock. Thus, in a serializer with a bit rate exceeding 1 gigabit per sec- 
ond, it would be very difficult to generate a dock with that high a frequency, as well as latches, flip-flops, etc. 
The difficulty is even more substantial when it is desired not to employ high power consuming or futuristic sem- 

20 iconductor technologies. Bipolar technology, for example, is capable of switching at such speeds, but generally 
consumes a large amount of power, necessitating high capability powersources.chipcooling, and similar prob- 
lems. Gallium arsenide is another technology which can provide such high speed docks; however, components 
manufactured using gallium arsenide technology are generally expensive and not widely available: In contrast, 
CMOS technology is widely available, relatively inexpensive, and generally does not consume much standby 

25 power. 

To overcome these disadvantages, the dock 60 of our system, along with the remainder of the system, 
preferably is fabricated using standard CMOS technology, yet enables the provision of a sufficient number of 
clock phases to adequately handle data rates on the order of greater than 1 gigabit per second. 

Figure 2a is a diagram illustrating the layout of the clock lines shown in Figure 2. When multiple clocks 

30 are distributed across a circuit board or other substrate, cross-talk from neighboring wires impacts the signal 
delay on each clock line. Figure 2a illustrates a series of parallel dock lines, each capacitively coupled to an 
adjoining dock line. If the clocks are multiphase docks with equally spaced phases, the dock signals in the 
middle of the set of lines will be affected in the same way as the signals on adjoining dock lines, thereby nul- 
lifying the adverse effect of the capacitive coupling. The only exception to the nullification are the dock lines 

35 near the edges of the parallel conductors. Because those clock lines have one of their neighbors missing, there 
is an asymmetrical effect, thereby causing dock skew. As depicted in Figure 2a, however, in the preferred em- 
bodiment of this system, dummy dock wires are distributed to make dock lines on each edge of the parallel 
group receive symmetrical coupling from the dummy wires, thereby removing any skew. Thus, dock phases 
<|>o and fa are duplicated andplaced adjacent to'the lines carrying dock phase Similarly, lines carrying <(>„_ 

«> , and ij> n are placed adjacent the line carrying dock phase fa\ In this manner, the asymmetrical capacitive cou- 
pling is nullified. In an embodiment in which every dock line is required rather than every fourth line, the layout 

is modified accordingly so that fa, fa, fa fa.,, fa are adjacent. In another embodiment, dock lines can be 

placed in such a way that neighboring docks have settled into static 0 or 1 , when the one in the middle makes 
a transition. For example, with 30 phase docks, fa must have its neighbors be fa) fa, fa s or fa*, but not with 

45 fa 9 or fa. Of course, numerous lines are needed at the far sides of the bus. < 

Figure 3 is a block diagram of clock circuit 60 used to generate the phase docks supplied to the serial link 
system, whether employed singly or in a multiple channel embodiment such as depicted in Figure 2. As shown 
in Figure 3, the clock system indudes a phase frequencydetector 64, a charge pump and loop filter 65, and 
a series of delay cells 68. The circuit depicted functions to receive a reference dock signal on line 70, and in 

50 response to transitions in that dock signal generate a desired number of internal docks on output lines 73 due 
to the effect of a frequency divider. Each of the docks corresponds to a phase shifted dock signal. In the em- 
bodiment shown, the 30 equal phase shifted clock signals are generated with twice the reference frequency. 
In other words, between two sequential rising edges there are 29 rising edges of an internal clock. With a 50 
MHz external reference clock, the system effectively can supply a much higher speed clock, for example, in 
55 the embodiment, the number of edges possible with a single clock frequency of the 30 docks is 3 GHz. These 
multiple phase docks can then be used to control the transmitter and data sampling functions of the serial 
link. For the particular embodiment depicted, 10 phase docks (every third one of the thirty) are supplied to 
the transmitters and 30 phase docks are supplied to the data sampler. The 30 outputs from the data sampler 
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can then be examined in the receiver to recover the correct data and original reference clock. Using the em- 
bodiment depicted in Figure 2, the phase docks also can be shared among various channels of serial link sys- 
tems. 

The embodiment depicted in Figure 3 functions as a phase lock loop. The phase detector 64 compares 
5 the frequency of one of the subphase clocks ckO 73, after division by two, from one delay cell with the frequency 
of the reference dock signal on line 70. The output of the phase detector, after passing through loop filter 65, 
is fed back through the delay cells to keep the delay precisely in phase with the reference dock signal. In this 
manner, the subfrequency dock-signals on Iines 73 are equal divisions of the original reference dock. 

Phase frequency detector 64 provides an output signal which is related to the phase difference between 
io the reference dock signal and the oscillator clock signal from a stage of the delay cells. This enables the os- 
cillator clock to be controlled in synchronism with the reference dock signal. The output of the phase frequency 
detector 64 controls the charge pump and loop filter 65, resulting in a control voltage being supplied on line 
74 to the delay cells 73. The control voltage speeds or siows the delay cells. Each of the delay cells provides 
an output signal which is delayed slightly from the output signal of a preceding delay cell. By virtue of the phase 
is lock loop, each of the outputs of the delay cells will be an integral division of the frequency of the reference 
clock signal. The frequency f of the oscillator is controlled by the control voltage with the relationship f = 

— - — where N is the number of delay stages and Td is the delay time of each delay cell as a function of the 
2-N-Td 

control voltage. 

20 The dock generator includes a voltage controlled oscillator (VCO) constructed with 15- stage differential 

delay cells. a frequency divider, a phase frequency detector, a charge pump, a loop filter, and dock buffers. 
Because each phase dock is buffered by two parallel buffers, two groups of 100 MHz, 30-phase clocks are 
generated from the dock, buffers. Separating the equally loaded and equally spaced 'dean' docks from 
the 'dirty' docks is the reason each phase dock is buffered by two dock buffers and produces two clocks that . 

2s have equal phases for the two groups of 30-phase clocks to be generated. One set is used as 3 GHz data 
oversampling docks in the receiver. In data oversampling it is important that the sensing interval be as equally 
spaced as possible, that is, the sensing interval is to be controlled to within 0.33 ns. This set of docks must 
be equally spaced and equally loaded, that is, 'clean' clocks. The other set of docks is used for the transmitter 
clocking, with one dock being used by the data-loader contained by the transmitter for loading 10 bits input 

30 parallel data. The docks 73 are 100 MHz 10-phase docks. The interval of each phase is equal to the transk 
mission time of one bit serial datum. 

Figure 4 is a circuit schematic illustrating the circuit structure of the dock generator. In the lower right- 
hand portion of Figure 4, two delay cells 86 and 87 are shown. Each of delay cells 86 and 87 correspond to 
one delay cell in the 30 stages of delay cells 68 shown in Figure 3. At nodes 88 and 89, delay cell 86 receives 

35 the reference dock sign al and its inverse. The clock signal toggles transistors 91 and 92 on and off generating 
output signals OUT and OUT which are used to control dock driver circuit 90. Clock driver circuit 90 is a con- 
ventional dock driver and uses the differential output signals to generate dock signal ckO. The same output 
signals used for ckO are also coupled to provide input dock signals to transistors 93 and 94 in delay cell 87. 
In the same manner, delay cell 67 provides output signals at an intermediate node which are both supplied to 

40 clock driver 95 to generate dock signal ck1, and passed to a subsequent delay stage (not shown). Because 
of the switching delay in each stage of delay cells, the input clock signal for each subsequent stage of delay 

cells will be slightly delayed from the input dock signal at preceding stages. Thus, docks ckO, ck1, ck2 

ckn will each be delayed from the preceding dock by a small amount 

One of the dock signals, in the example, ckO, however, is also coupled back via line 80 to phase frequency 

45 detector 64 where it is compared with the original reference dock signal arriving on line 70. Depending upon 
the phase relationship of the reference dock and the clock chosen from the delay cell chain, the phase fre- 
quency detedor will supply output control signals to turn on and off transistors 81 and 82. Phase frequency 
detector 64 is of well known design.. The control signal supplied from phase frequency detector 64 will pull the 
node A between transistors 81 and 82 up or down, depending upon the phase relationship of the reference 

so dock and dock ckO. When transistor 81 is on, current is pumped into the node A which increases the control 
voltage applied to transistors 96, 97, etc, commonly coupled to node A. This increased voltage turns on tran- 
sistors 96 and 97 more strongly, thereby shortening the switching time of transistors 91, 92, 93, 94, etc., in 
turn, shortening the clock delay within each stage. On the other hand, if transistor 82 is on, node A is pulled 
down, lowering the control voltage on node A and slowing the operation of each delay cell. 

55 A well known loop filtering circuit is also provided and coupled to node A, as shown. In addition, an external 

current control resistor is coupled to node 84 and used to control the maximum current which can be drawn 
by the phase frequency detector control signals. 

Also depicted is a replica bias drcuit which is coupled to the control nodes of transistors 96, 97, etc., as 
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well as to other transistors in the delay cell chain. This replica bias circuit stabilizes the voltage on the delay 
cells by minimizing the impact of fluctuations in the power supply (VDD) caused by the switching of the delay 
cells. 

Thus, the output of each delay cell is tapped to provide a clock signal which is supplied through a corre- 
sponding clock driver 90 to one of the clock lines shown in Figure 2. In this manner, a series of equally spaced 
clock signals, each of a slightly different phase than the preceding clock signal, is provided for use by the trans- 
mitters and receivers of the serial link. 

As described, the delay cells are implemented with a differential structure that contains a voltage control- 
led resistor (VCR) controlled by a replica biasing circuit The differential structured delay cell is used to support 
a high frequency oscillation. The replica biasing circuit has the same structure as the VCO delay cell. Thus, 
the minimum voltage level of the clock signal swing is realized at the replica biasing circuit, which is compared 
with a reference voltage by an operational amplifier. This bp amp compares the minimum voltage levet of the 
clock signal with the reference voltage to control the VCR included in the replica biasing circuit and 15-stage 
delay cells. As a result, the VCR's are adjusted by the op amp to limit the voltage swing of the VCO signal to 
about 1v, which can support the high frequency oscillation. A compensation MOS capacitor is included for the 
stability of this feedback loop. The current level of the charge pump is controlled by a current mirror circuit 
The current level of the current mirror is controlled by a resistor connected to VDD via a external pin 84. The 
size of the current mirror is about ten times the size of the current source of the charge pump. This removes 
the effect of a leakage current and increases the precision of the current level of the charge pump. 

Figure 5 is a timing diagram illustrating the phase relationship of a series of clock signals from the system 
of Figure 3. As described, each clock signal is slightly delayed from the preceding clock signal: 

Figure 6 is a block diagram illustrating the parallel-to-serial converter portion 15 (Figure 1) of the trans- 
mitter circuit in the serial link. As shown in Figure 6, the parallel-to-serial converter 15 includes a pair of registers 
117 and 118. Register 117, preferably a series of D-type flip-flops, receives as input the data from each line 
of the parallel data bus, and latches such data into register 11 7 upon receipt of a clock signal, designated clock 
1 in Figure 6. 

Register 118 is coupled to receive the data from some of the flip-flops of register 117, and latches that 
data in upon receipt of a clock signal clock 2. The extra register 1 1 8 provides additional set up and hold time 
and is therefore optional. With register 118, new data can be loaded into register 117 while data from the pre- 
vious cycle is still being transferred from register 118. After the data has been clocked into the registers, it is 
then clocked into an AND/OR circuit which consists of a series of AND gates 110 and an OR gate 115. 

Because in the preferred embodiment eight bits of parallel data are encoded as ten bits of serial data, ten 
data bits are shown in the embodiment depicted in Figure 6. Although any desired encoding scheme may be 
employed, this coding scheme is used in the preferred embodiment to assure that, regardless of the state of 
the data bits, a sufficient number of edges is always presented to enable a receiver on the serial data link to 
sy nchronize wit h the incoming data. Without the assurance of "edges" in the incoming data, the receiver can 
drift when long strings of zeros or ones are being received. Additionally, the two extra bits allow for balancing 
of the data stream to assure that iong streams of zeros or ones do not cause the receiver bias to drift 

The data from the registers is presented to AND gates 110 with each bit on a separate conductor 116. Each 
conductor is coupled to a corresponding one of the AND gates 110. Each AND gate also receives one of the 
subf requency clocks generated by the clock generator circuit depicted in Figure 3. Preferably, the subf requen- 
cy clocks are equally spaced. In other words, if 1 0 clocks are employed for serialization, then every third clock 
signal of the 30 possible clocks is chosen. Considering one example, AND gate 110d receives data bit 2 at one 
input terminal and phase clock fa at a second input terminal. Were it not for the third input terminal of the AND 
gate, the presence of the clock signal would enable the AND gate to pass the data on from the register 116 
through the AND gate to the output terminal of the AND gate and then onto one of the input terminals of the 
OR gate 115. 

Because the clock signals fofa are overlapping, it is necessary to prevent more than one input terminal 
of the OR gate from being active at any given time. This is achieved by providing a third terminal on each AND 
gate connected to receive the inverted clock signal of the phase next in order to the phase directly coupled to 
that AND gate. For example, AND gate 110d receives phase fa and through an inversion terminal is coupled 
to also receive phase fa. In this way, whenever clock signal fa is active or inactive, and clock signal fa is not 
active, the AND gate is enabled and the data supplied to it is passed to the OR gate 115. In contrast whenever 
both clock phases fa and fa are active, the AND gate is disabled, and no data is passed to the OR gate 115, 

The OR gate 115 receives as input signals the output signals from each of the AND gates 110. Because 
the input signals arrive at the OR gate sequentially, the output of the OR gate 115 provides a serial data stream 
representative of sequential sampling of the parallel input data supplied to the register 117. By appropriately 
phasing clocks 1 and 2 supplied to registers 117 and 118. these registers can be reloaded with data so that 
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upon the rearrival of dock phase ifo, a second set of data is ready to be clocked through the OR gate 115 without 
need to wait for reloading the registers. 

Figure 6a and Figure 6b show the detailed structure of the 20- to- 10 multiplexer and the timing diagram 
of multiplexer operation. The multiplexer is composed of 10 NMOS pass transistors and 10 PMOS transistors. 

s The lower 10 bits of the 20 bits input data are connected to the 10 NMOS pass transistors and the upper 10 
bits are connected to the 10 PMOS pass transistors. A 50 MHz mux control clock named 'mux_switch' is ob- 
tained by dividing "M_clk(2)' by two. As shown in Figure 6b the 'mux_switch' dock is toggled according to the 
falling edge of the 'M_dk{2)'. While the 'mux_switcrr dock is held high the 10.NMOS pass-transistors are 
turned on and the lower 10 bits are transmitted to the transmitter via the mux and sensed by a data-loader at 

10 the rising edge of "M_dk(2)'. While the dock is held low, the 10 PMOS pass-transistors are turned on and the 
upper 10 bits are transmitted to the transmitter.via the mux and sensed at the rising edge of the 'M_dk(2)\ 

Figure 7 is a circuit schematic illustrating the circuitry used in the AND and OR gates 1 10 and 115 in Figure 
6. In Figure 7 each branch 120 provides an AND function. For example, for the left-hand branch of Figure 7, 
the node 125 will be controlled by the state of d5 only if dock ckO is present, and dock ck1 is not active. Thus, 

15 when the proper dock signals are active, the output node receives the data from node d5. The branches rep- 
resentative of the AND gates are alt coupled together in common at node 125, thereby representing the OR 
gate. In other words, as the dock signals are phased, the data presented at the control nodes of the MOS tran- 
sistors is placed on the output node 125. Thus, the output node first carries data d5, then d6, then d3, etc 
In a preferred embodiment of the invention, a differential data signal is used. In this case, the right-hand 

20 portion of Figure. 7 is also utilized. The right-hand portion of Figure 7 corresponds to the left-hand portion of 
Figure 7, except that the data is inverted, or is complementary, to the data on the left-hand side. Thus, the 
left-hand branch of the right-hand portion of Figure 7 receives an inverted data signal d5, and all of the branches 
also receive inverted data signals. In this manner, complementary data is presented at output node 128 from 
that at output node 125. An enable signal is coupled to the control electrodes of transistors 130 and 131 to 

29 switch the data from nodes 125 and 128 out onto line 135 to ultimately drive the serial link. 

Figure 8 is a circuit schematic of an output driver and bidirectional buffer. The circuit schematic of Figure 
8 preferably is employed as the bidirectional buffer 25 of Figure 1 or Figure 2. The circuit shown is a single-, 
ended circuit. A differential bidirectional buffer is shown in Figure 9 and described below. 

• When a transmission line is used as a media for communication, simultaneous use of the transmission 

30 line in both directions is permissible only if a proper circuit is employed to separate incoming data from the. 
combined incoming and outgoing data appearing on the transmission line. In this manner, fully duplexed op- 
eration is permitted. Of course, fully duplexed operation is advantageous because it doubles the available 
bandwidth without incurring any extra hardware costs for the media. 

The circuit shown in Figure 8 functions as a subtracter which subtracts the outgoing data from the mixed 

35 . data present on the transmission line, in effect thereby extracting the incoming data. Implementing subtracter, 
circuits in CMOS is typically difficult because exact resistors are usually required for arithmetic operations, 
and such exact resistances are not available in CMOS technology. 

The circuit shown in Figure 8 is particularly advantageous because it does not require the use of resistors. 
Instead, it uses a combination of a replica-sending signal generator and differential pairs. The replica-sending 

40 signal generator makes available an exact replica of the sending signal outgoing from the transmitter. By then . 
comparing the received signal with the outgoing signal, the net incoming signal can be determined. For in- 
stance, if the received signal and the replica signal are the same, the result of the comparison is 0, meaning 
that there is no net received signal. If the incoming signal, however, is greater than the replica signal, the result 
of the comparison is positive, meaning that a positive signal was received. on the transmission line. Similarly, 

45 if the comparison is negative, a negative signal is received on the line. 

In Figure 8, the mixed signal on transmission line 100, consisting of both incoming and outgoing data, is 
supplied to node 140 which is coupled to one electrode of transistor 144. The other electrode of transistor 144 
is coupled to receive a reference voltage at node 146. The reference voltage is generated using a reference 
voltage applied to transistor 145 which is coupled to node 146 and to a current source 147. 

so The control electrode of transistor 144 is coupled to receive the data outgoing from the transmitter in an 

inverted form. This data is also supplied to transistor 149 where it is inverted and supplied at node 150. Thus, 
node 150 represents a replica of the outgoing data signal from the serial link transmitters. The transmission 
line 100 is also coupled to the control electrode of transistor 142 of a differential transistor pair 142 and 152. 
The control electrode of transistor 1 52 is connected to node 150 to receive the replica of the outgoing data. 

55 Another electrode ofeachofthe transistors 142 and 152 is coupled togetherto a current source 155.Theother 
electrode of transistor 142 is coupled to a potential source, while the other electrode of transistor 1 52 Is coupled 
- through a load to a potential source. Thus, the differential pair of transistors 142 and 152 compares a replica 
of the outgoing signal with the combined incoming and outgoing signals present on the transmission line. When 
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both of these signals have the same potential, the output signal on line 157 will be at 0 volts. On the other 
hand, if the transmission line 100 is higher in potential, the output on line 157 will be positive, while if the replica 
of the outgoing signal is higher, the output of the circuit on line 157 will be lower in potential. 

The circuit shown is particularly advantageous because the voltage swing on the output 157 is relatively 
small, on the order of one volt. Thus, the output switches more quickly than conventional subtracting circuits. 
Full duplex data transfer using only a coaxial cable is made possible by the bidirectional bridge and a proper 
impedance matching circuit. 

Figure 9 shows a complementary embodiment of an impedance matching circuit and a bidirectional bridge. 
PMOS transistor 175, included as a voltage controlledresistor (VCR) in the impedance matching circuit, is an 
equivalent 500 ohm resistor. Each VCR 170 and/173 included in the bidirectional bridge is art equivalent 50 
ohm resistor because each is ten times larger than transistor 175 and controlled by the same VCR-control 
signal ZObias. Similarly, transistors 171 and 172 are equivalent' 250 ohm resistors. The transistors 170 and 
173 act as internal termination resistors of the transmission line to remove signal reflections. At the 
nodes 'dsend' and "dsendb", the outgoing differential signal appears, while at the nodes 'twinax_da- 
ta' and 'twinax_datab\ the mixture of the outgoing differential signal and the incoming differential signal ap- 
pears. Because the 'twinax_data' and 'twinax_datab' are the cable ends of the co-axial transmission line 
whose characteristic impedance is 50 ohms, the active load resistance of that node is 25 ohms based upon 
the parallel 50 ohm VCR and the characteristic impedance. At the maximum current steering circumstance, 
the current level of the nodes of 'twinax_data' and twinax_datab' is ten times greater than transistor 1 75 (which 
is an equivalent 500 ohm resistor). Thus, when the current level of the transistor 175 is 2 mA, twinax_data, 
twinax_datab will supply 20 mA. The maximum swing of only outgoing differential signals appearing at the 
nodes (twinax_data, twinax_datab) without incoming differential signals is about 0.5 v (25 ohm x 20 mA = 0.5v). 
If the incoming differential signals are received, the mixed differential signals of incoming and outgoing signals 
appear at the nodes twinax_data and twinax_datab; If the incoming differential signals and the outgoing differentia) 
signals are 'in-phase', the maximum swing of the mixed differential signals is removed (0.5 - 0.5 = Ov), that is, the 
deviation between twinax_data and twinax_datab is zero. The extraction of the incoming differential signals from 
the mixed differential signals is performed by a extractor circuit included in the bidirectional bridge. The mixer-struc- 
tured extractor circuit mixes the differential signals constructed with Vtwinax_data' and Vdsend' and another dif- 
ferential signals constructed with Vdsendb' and Vtwinax_datab'. The formula showing this mixing mechanism 
is: < 

Vrec-Vrecb = A((Vtwinax_data-VoWd)+(Vdse^ 

A((V twinax_data - Vtwinax.datab) -(Vdsend- Vsendb)) 

The impedance matching circuit is constructed with two feedback loops, one for generating a current-bias- 
ing signal named 'lObias', the other for a VCR control signal named 'ZObias'. 

The opampl shown in Figure 9 compares reference voltage Vext_swing with a node voltage Vext_res: That 
node is connected to VDD through a 500 ohm resistor. If there is a deviation between Vext_swing' and Vext_res'. 
the output signal of the opampl controls the current level of the 500 ohm resistor to compensate for the de^ 
viation by adjusting the gate voltage of a current source. The gate voltage of the current source is used as a 
current-biasing signal named 'lObias'. As a result, Vext_res' is equalized to 'Vext_swing' by the negative feed- 
back loop constructed with opampl. 

The opamp2 shown in Figure 9 compares the *Vext_swing' with an internal node voltage'Vnodel' which 
is the drain of a PMOS transistor used as a VCR. If there is a deviation between the Vext_swing' and Vno- 
dei" the output signal of the opamp2 controls the gate voltage of the VCR to remove the deviation. The output 
signal of opamp2 is used as a VCR control signal 'ZObias'. As a result, the VCR is equivalent to a 500 ohm 
resistor because the current level of the VCR is equalized to that of the 500 ohm resistor by the lObias sig- 
nal. 'Vnodei' is equal to the "Vext-res' which are equalized to the 'Vext_swing\ Thus, a PMOS transistor can 
be used as a VCR equivalent to a resistor if the gate voltage of the VCR is controlled by the 70bias" signal 
and the current level is determined by an 'lObiased' current source. For the stability of the feedback loops the 
internal compensation MOS capacitors are included; These feedback loops become active at power-up and 
continually compensate for temperature and power supply variation. 

Figure 10 is a block diagram illustrating in more detail the receiver system. The system shown in Figure 
1 0 converts the serial data arriving on the line designated 'Serial In' to parallel data and supplies it as Rx Data 
to other apparatus {not shown): The data present on the Serial In line corresponds to the data supplied from 
the bidirectional buffer shown in Figure 9. In other words, the data on the Serial In line has already had the 
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outgoing data from the transmitter subtracted from the mixed signal present on the transmission line, and 
therefore represents the data to be converted to parallel form. Although depicted in Figure 10 in single-ended 
form, it should be appreciated that in the preferred embodiment, all of the circuitry shown in Figure 10 is dif- 
ferential. 

5 In Figure 10, the clock 60 provides 30 phases of clock signals to the data sampler. Thus, assuming that 

an encoding technique is used which results in ten bits of data being present, the data sampler will sample 
each bit of data at three different times. The three-fold sampling enables determination of edges in the data 
stream and the alignment of the data hits.<?onrectly. ; ., r . v .- , 

In general, there are two goals forthe structure shown in Figure 1 0. First, the incoming data must be sam- 

w pled In a manner which enables correct, detection of the state of any bit in the incoming serial data stream. 
Because there is no synchronization of the clock signals between the original station from which the data was 
sent and the receiving station at which the data is detected, the system must be able to determine the edges 
of each data bit. In addition, the system must.he able to determine the correct location of the first data bit in 
each frame of data. Otherwise, although the data is correctly detected and sampled, the contents of each byte 

is (word) will be incorrect because that word may include as its higher order bits the lower order bits of a preceding 
byte, and as its lower order bits the higher order bits of the byte to be received. Other errors are also possible. 
Thus, the system shown in Figure 10 functions to detect the state of each data bit, and to align the bits into 
the proper frame boundaries as complete bytes or words. 

The data recovery system of Figure 10 provides low oversampling ratio, low latency, and parallel data re- 

20 covery. It uses triple oversampling based on the assumption that the frequency difference between a remote 
station dock and a local station dock will be small. The low oversampting ratio and paraljel.data processing 
increase the data processing rate. In one clock cycle of a 100 MHz clock, the data sampler provides 30 bits of 
data, sampled corresponding to 1 0 bits of actual data. The digital phase locked loop finds transition edges in 
the data. There can be as many as 10 low-to-high or high-to-low transitions in 30 bits of sampledata. Multiple 

25 transition positions are digitally averaged and low-pass filtered. Digital low pass filtering removes fast phase 
variations caused by jitter, yet follows gradual phase drift caused by frequency differences between the remote 
station and the local station. , 

Figure 11 shows a typical data signal on the transmission line together with the 30 sample points. The 
clock 60 provides 30 phase clocks to the data sampler coupled to the serial link. For illustration, the hypothetical 

30 bitstream of 1001110101 is shown in single-ended form. If a differential transmission line is employed, then a 
mirror image of the waveform shown in Figure 11 will be added, and that mirror image will be sampled with 
the same clock signals, but additional circuitry. Effectively, however, the parallel sampler functions to sample 
the waveform and supply the resulting binary output signal to a data shuffler 210 (Figure 10). 

The Data Sampler in Figure 10 indudes sense amplifiers for sampling the incoming data. Figure 12 illus- 

35 trates in more detail the sense amplifiers which detect the state of the data on the data-in line shown in Figure 
10. As shown in Figure 12, input nodes 211 and 212 are coupled to receive the differential signal from the bi- 
directional buffer. The signal is then passed through an inverting stage formed by inverters 213 and 214 and 
supplied to a pair of serially coupled transistors coupled to appropriate dock phases. In the case of the sense 
amplifier depicted in Figure 12, dock phases and are used. Other sense amplifiers coupled to the serial 

40 link will use other dock phases enabling sampling the data at a different time. The sense amplifier functions 
using a precharge, sample, regenerate and hold sequence. The first stage shown in the upper portion of the 
figure functions primarily for the detection of the signals, and then a second stage, shown in the lower portion 
of the figure, provides main amplification of the detected signals. 

When docks <h and overlap, the input signals are supplied to nodes 215 and 216. The circuitry between 

45 nodes 215 and 216 forms a cross-coupled differential sense amplifier of well known design, and amplifies the . 
signal. 

The transistors generally at 217 receive clock signals and function to equilibrate the nodes 215 and 216 
before sensing. In other words, immediately before clock signals and overlap, nodes 215 and 216 are 
shorted together to equilibrate them. Then the differential voltage from input notes 211 and 212 can be more 

so accurately sensed. The transistors at 21 8 connect the first stage of the sense amplifier with the second stage. 
When these transistors are on, the output signal from the first stage of the sense amplifier is transferred to 
the input node of the second stage of the sense amplifier. The second stage is similar to the first stage, and 
is logically equivalent to an inverting transparent latch. 

As shown in Figure 10, a phase detector 240 is coupled to the data shuffler and functions to detect edges 

55 in the data. For example, the phase detector can detect the presence of an edge between dock signal ck2- 
and ck3 (see Figure 1 1), or between ck20 and ck21 or any two dock signals. Once the reliable location of an 
edge is known, the individual data bits can be determined. In other words, once the edge between ck2 and 
ck3 is known, then it is known that the samples taken at dock signals ckO, ck1. and ck2 represent one bit in 
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the data stream, while another bit in the data stream is represented by the samples taken at clock times ck3, 
ck4, and ck5, etc. The data sampler provides this information to the voter to enable it to determine which data 
samples should be used to represent bits. For example, using the previous hypothesis, the data sampled at 

clock signal ck1, ck4 ck19 will represent a correct sample of the state of the bit on the data line. (These 

5 are the samples taken near the middle of the pulse forming that bit.) 

Of course, it is not sufficient to merely detect the bits. As explained above, for the data to be reliable, the 
frame boundaries around each byte or word of data must be determined. The data shuffler 210, effectively a 
barrel-shifter, performs this function. This technique is explained below. 

At the time the system is first turned on. data is encoded using the desired protocol. In the preferred em- 

10 bodiment, as mentioned above, an 8-bit to 1 0-bit protocol is used to provide sufficient edge regardless of the 
content of the data, and to provide statistical balance to the data to prevent biasing of the receiver. Because 
8 bits are encoded into 10, certain bit combinations represent illegal conditions. These conditions may be de- 
tected and used to provide frame control synchronization.'etc. When the serial link interfaces are initially ac- 
tivated, under software control, each station of the interface sends a known pattern to the other station of the 

is interface. The known pattern is sent repeatedly until the transmitter, at one end of the serial link, and the re- 
ceiver, at the other end of the serial link, synchronize with each other. Once the two have become synchronized, 
an edge pointer is determined by circuit 250. The edge pointer effectively "points" to the clock phase repre- 
senting the first bit of the data in each byte. Thus; for the sample data stream shown in Figure 11, the edge 
pointer might point to the bit samples at ck20 in the data stream as being the first bit of data in the sequence 

20 of 10 bits. In this manner, the system "knows" that the state of the data bits sampled during signal ck20 rep- 
resents the state of the first data bit in the frame. (The data separator will later remove the extra 2 bits of data 
from the 10 bits, treating them as "valid" bits and supplying the remaining 8 bits as output.) 

Once the frame boundaries are correctly determined, the voter 220, essentially a 3:1 multiplexer, desig- 
nates the correct 10 bits of output, properly arranged, to the data selector 230. As shown in Figure 10, the 

25 system can then supply the differential data output signals. 

Data recovery is done by passing t bit from the oversampled 3 bits. Only the bits which sample the centers 
of incoming data are passed. Ten bits are processed in one internal clock cycle. Because other digital circuits 
outside the digital phase lock loop (DPLL) operate at half the frequency of the DPLL, the DPLL delivers 20 
bits of parallel data in 20 ns. 

30 A 30 to 1 multiplexer selects one clock among 30 equally spaced clocks by two pointers. The selected clock 

is used for docking the DPLL itself after buffering. The selected dock is divided by two before it is passed to 
other digital circuits. This divided dock has the same frequency as that of remote station. 

The DPLL also does most significant bit (MSB) alignment The coder outside the DPLL checks the MSB 
position in the 20 bit parallel data during preambling period. When the recovered data are not word aligned, 
35 the coder asserts the SKIP signal. The DPLL rotates the bit positions by 1 bit when it sees the SKIP signal 
asserted. The controller asserts the SKIP signal until recovered data are word aligned. 

There are two pointers which holds current phase and word information. The phase pointer is a 3 bit ring 
counter, and it stores the phase information. The phase pointer, P, can take 3 values 0, 1, 2. Only 10 bits out 
of oversampied 30 bit parallel data, OverData[0:29], are selected according to the phase pointer. When P = 
<o "O", OverData[1]. OverData[4]. .... OverData[28] are selected as recovered data. The relation between the 
phase pointer and recovered data, RecData is: 

RecData[n] = OverData[3n + (P + 1)mod3] 
n=0,1,2,...,9 ' 
If the current phase pointer is 1, it means that the low-to-high data transitions in the oversampled data 
45 occur between OverData[3n] and OverData[3n+1 ]. For each cyde, low-to-high or high-to-low data transition 
positions are counted and compared with current phase pointer. The number of transitions occurred between 
OverData|3n] and OverData[3n+1] is encoded into a 4 bit number TranOL Tran01, Tran12 and Tran20 are given 
by: 

50 . 

TRAN01 = £ (OverData[3n]0OverData[3n+l]) 
n=0 

55 ' 
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9 

TRAN12 = £ (OverData[3n^]®OverData[3n+2]) 
u=0 

5 

. 8 . .. . . 

TRAN20 - ^.(OverData[3n+2]®OverData[3(n+l)]) 

The digital edge voter does the summing and comparing of the number of 1's. TranOI, Tran12, Tran20 are 
compared with current phase pointer to, decide whether to move the phase pointer. The phase which has the 
is maximum number of transition is compared with current phase. If the new phase leads the current phase, the 
UP signal is generated. The UP signal is low-pass-filtered. Three successive UP signal updates the phase poin- 
ter by 1 point If the shape of the incoming data is square and the sampling process is perfect, every 1 bit will 
be sampled 3 times and transitions will occur at the points where the phase pointer indicates. However, in a 
real system environment, there are several error sources which result in the data pattern not having three suc- 
20 cessive 0's or Vs. For this reason, the low pass filtering is necessary. 

The word pointer 240 is used for 20 bit alignment and clock selection. It is a 10 bit ring counter and can 
have 10 values.,0, 1, 2 9. There are two cases when the word pointer changes. First, when the DPLL re- 
ceives the SKIP signal, the word pointer is increased by 1. Second, when the phase pointer changes from 1, 
to 2, the word pointer is increased by 1. When the phase pointer changes from 2 to 1, the word pointer is de- , 
25 creased by 1 . The word pointer and phase pointer are reset to 0 when the HP coder sends the RESET signal. A 
Clock recovery is done by selecting one of the 30 multiphase clocks. The selection is done by the word „ 
pointer and the phase pointer. The selected clock is buffered and used by the DPLL itself. Because the coder 
and other digital circuits operate at 50 MHz, the. clock is divided and buffered. The relationship between the 
pointers and selected clock can be represented by the following equation. 
30 INT_CLK = CKI3{(W + 1)mod10) + (P + 1)mod3] 

The local station tracks the frequency and phase of the remote station by changing the word and phase 
pointers. When the frequency of the remote station is higher than that of the local station, the clock which leads 
the current clock is selected. The word pointer change by SKIP signal also changes the clock selection. Figure 
13a shows the clock selection process-when the frequency of the transmitting station is higher than that of 
35 the receiving station. Figure 13b shows the process when the receiving station is faster, 

The data sampler (Figure 10) oversamples the incoming data with the 30 multiphase clocks. SeqOverDa- 
ta[0:29] are oversampled data. Because the sampling happens sequentially, the outputs of the sampler are 
also available in sequential manner, synchronized to the 30 phase clocks. The data shuffler does the bit align- 
ment. The INT_CLK synchronizer comprises 1 5 flip-flops. It synchronizes the 30 bit SeqOverData to the inter- 
w nal clock of the DPLL The edge voter counts and compares the number of : transitions. 

Figure 14 shows the phase pointer. It is a three-bit ring counter, which circulates a single bit according to 
the PDOWN and PUP signals. The word pointer uses the same configuration as the phase pointer, but the 
word pointer is a 10 bit ring counter. The connection between the two pointers is shown in Figure 15. When 
the phase changes from 2 to 1, the minus signal is asserted to decrease the word pointer. As the SKIP signal 
45 is generated by the coder, it is synchronized to 50 MHz clock. If the SKIP signal is given to the DPLL directly, 
it will be asserted for two cycles. The SKIPsync generator synchronizes the SKIP signal generated by the coder 
to the internal clock of the DPLL. If the SKIPsync signal and the MINUS signal is asserted at the same time, 
SKIPsync signal is ignored to prevent spurious effects. 

There are two reasons for using the reset synchronizer. First, because the states of the pointers are not 
so predictable at power-up, a resetting mechanism is necessary. Second, the reset signal makes the operation 
of the DPLL more reliable by making the RESET signal synchronous to the internal clock of the DPLL. Figure 
16 shows the reset timing diagram. 

The clock multiplexer of Figure 17 is a one stage 30-to-1 multiplexer. It selects one out of 30 multiphase 
clocks. The first stage of the mux is controlled by the phase pointer and the second stage is controlled by the 
55 word pointer. When the phase changes, the internal clock period is reduced temporarily to 29/30 of the system 
clock period. 

The data shuffler aligns the sequentially available data according to the word pointer value. Figure 18 is 
a schematic of the data shuffler. The data shuffler is essentially a barrel shifter. 
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The data selector of Figure 19 passes only 10 bits out of 30 oversampled data according to the value of 
the phase pointer. The selector is an array of ten three-to-one muxes. Since the phase changed data arrives 
at the data selector 1 cycle after the phase pointer changes, three flip-flops are inserted to delay the phase 
pointer values by one cycle. 

Figure 20 is a detailed circuit schematic of the voter 220 shown in block form in Figure 10. The voter es- 
sentially consists of three different sections, each of which receives the oversampled data. Effectively, all of 
the edge information is grouped into three transition positions, each shifted by one clock phase from the pre- 
vious position, and the position with the most transitions is detected by comparing the three numbers that count 
the edges that belong to each group. Thus, the upper portion of the voter 232 includes a series of OR gates, 
each of which is connected to receive two adjoining bits of sample data. For example, OR gate 253 receives 
the sample data from clock ck3 and ck4. The corresponding OR gate 255 in section 235 receives sample data 
at clock phase ck4 and ck5, and another OR gate 257 in section 238 receives sample data at clock ck5 and 
ck6. Thus, if an edge transition occurs between ck4 and ck5, a count of 1 will be produced in section 235 and 
no count will be produced in sections 232 or 238. The OR gates are coupled within counters, such as depicted 
by the circuitry within block 261 , which in turn are compared by the comparators 265. Ultimately, the high count 
appears at one of nodes V0, V1 or V2. This high count can then be used to control the edge detector240 (Figure 
10) which in turn controls the data aligner 210 (Figure 10).- 

In the voter, blocks marked FA are full adders and those marked HA are half adders. There are three out- 
puts from the edge voter. LTran12 is high when the number of transitions between OverData[3n+1 ] and Over- 
Data[3n+2] is larger than the number of transitions between OverData[3n] and OverData[3n+1]. 

Figure 21 illustrates the phase tracker and associated circuitry. Phase tracking is required because drift 
between the transmitter and receiver docks may result in the receiver phases changing slowly over time to 
the point where a bit which should have been interpreted as being in a given cycle is being interpreted as being 
in the preceding or subsequent cycle. This can cause errors in the received data stream. Preferably, the phase 
tracking is coupled to receive the three output signals LTran from the voter circuits shown in Figure 20. Every 
cycle the output signals from the voter can be used by the phase tracker to determine whether an edge moved. 
If an edge moved, the edge' pointer in Figure 10 receives a signal to adjust the location of the pointer. 

The tracking filter receives' the transition number information from edge voter. It compares the transition 
phase with the current phase to change the pointers. Only three successive UPs or DOWNs can change current 
pointer. UP and DOWN signals are generated as in Table 1. E01 . E12, E20 are defined as: 

; E01 = LTRAN 01 • LTRAN 12 
E12 = LTRAN 12 ■ LTRAN 20 
E20 = LTRAN 20 • LTRAN01 

Table 1 below illustrates the logic performed by the phase tracking circuit. 



TABLE 1 


Voter Output 


Current Phase 


Control Signal 


V2V0 V0VT V1V2 


P0 P1 P2 


Up Down 


0 0 0 


X X X 


• 0 0 


1 0, . 0 


1 0 0 


10 


1 0 0 


0 1 0 


0 0 


1 0 0 


0 . 0 ,. 1 


0 1 


0 1 0 


i o ' o 


0' 1 


0 1 0 


0 1 0 


■1 ■ i , 0 


0 1 0 


' o ' 6 1 


6 o 


0 0 1 


1 0 0 


0 0 


0 0 1 


0 1 o 


0 1 


0 0 1 


0 0 1 


1 0 



The first three columns represent combinations of the three input signals V0, V1 and V2 to the phase tracker. 



12 



EP 0 686 920 A2 



The current phase of the system is illustrated in the next three columns. The output signals supplied by the 
phase tracker are shown in the last two columns. As shown, if the current phase and the phase tracking input 
signals are the same, no up or down control signal is supplied. Other input signal conditions, however, can 
generate output signals as shown by the table. 
5 Figure 22 shows the timing of the transmitter. Figure 23 shows the overall timing of the receiver. Both fig- 

ures use the same signal designations as in previous figures. 

Although the foregoing has been a description of the preferred embodiment, this description is intended 
to be illustrative of the invention^ not limiting of, jL The scope of the invention is defined by the appended claims. 

Claims ■ . ' ., 

1 . A circuit for generating a series of high speed clock signals comprising: 
a reference clock signal; - . ■ 
15 a phase frequency detector coupled to receive the reference clock signal and in response thereto 

supply a first control voltage indicative of transitions of the reference clock signal from a first potential to 
a second potential, and a second control voltage indicative of transitions of the reference dock signal from 
the second potential to the first potential; and 

a plurality of-serially connected stages of delay cells, each cell connected to receive the control 
20 voltage from a preceding cell and supply it to a following cell after delaying it, and each cell supplying in 

response to changes in the control voltage a clock signal, a last cell in the plurality also supplying a last 
clock signal to the phase frequency detector, whereby a series of multiple subfrequency clock signals is 
generated which multiple subfrequency clock signals are evenly divided from the reference clock signal. 

25 2. ' A circuit for converting n bits of parallel data to serial data string of n bits comprising: 
a register having n storage locations for temporarily storing the parallel data; 
a source of n clock signals each dock signal having a different phase than every other clock signal; 
a plurality of AND gates, each gate having a first input node coupled to be enabled by one of the 
n dock signals and having a second input node coupled to receive data from one of the n storage locations, 
30 and having an output node; and 

an OR gate having n input nodes, one coupled to each of the output nodes of the n AND gates, 
and having a serial data output node where the parallel data from the n bits supplied in parallel to the 
circuit is supplied as a serial output signal. 

35 3. A circuit for converting n bits of parallel data to serial data string of n bits comprising: 
a source of n different dock signals; < 

a first group of circuit branches, each branch connected in parallel between a first node and a first 
potential source, each branch including three serially connected transistors wherein: 

a first transistor having a control electrode connected to receive one of the n bits of parallel 

40 data, 

a second transistor having a control electrode connected to receive one of the n different 

dock signals, 

a third transistor having a control electrode connected to receive a signal which is an inver- 
sion of a selected one of the n different dock signals, the selected one being the dock signal next in phase 
45 following the clock signal supplied to the second transistor; 

a second group of circuit branches, each branch connected in parallel between a second node and 
the first potential source, each branch induding three serially connected transistors wherein: 

a first transistor having a control electrode connected to receive a signal which corresponds 
to an inversion of one of the n bits of parallel data, 
so a second transistor having a control electrode connected to receive one of the n different 

dock signals, 

a third transistor having a control electrode connected to receive a signal which is an inver- 
sion of a selected one of the n different dock signals, the selected one being the dock signal next in phase 
following the clock signal supplied to the second transistor and 
55 an enabling circuit connected to the first and the second node for supplying signals therefrom to 

a conductor, whereby parallel data is converted to a serial data string and supplied on the conductor. 

4. A bidirectional bridge circuit for extracting incoming data from a data stream on a transmission line, the 
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data stream having a mixture of incoming and outgoing data, the circuit comprising: 

a first signal generator for replicating the outgoing data signals from a transmitter; and 
a comparison circuit having a first input node coupled to the first signal generator and a second 
input node coupled to the transmission line, the comparison circuit for subtracting the outgoing data sig- 
nals from the first signal generatorfrom the mixture of signals on the transmission line to thereby provide 
the incoming data stream. 

A bidirectional bridge circuit for extracting incoming data from a data stream on a transmission line, the 
data stream having a mixture of incoming and outgoing data, the circuit comprising: 

a first transistor having a control electrode coupled to receive a replica of the outgoing data trans- 
missions, having a first electrode connected to a current source and having a second electrode connected 
through a load to a potential source; 

a second transistor having a control electrode coupled to receive the data stream on the transmis- 
sion line, having a first electrode connected to the current source and having a second electrode con- 
nected to a potential source; and 

an output node connected to the second electrode of the first transistor for supplying the incoming 

data. 

A differential bidirectional bridge circuit for extracting incoming data from a data stream on a differential 
transmission line having first and second electrodes, the data stream having a mixture of incoming and 
outgoing data, the circuit comprising: 

a first circuit coupled to a first electrode of the differential transmission line, the first circuit includ- 
ing: 

a first transistor having a control electrode coupled to receive a replica of the outgoing data 
transmissions, having a first electrode connected to a current source and having a second electrode con- 
nected through a load to a potential source, and to a true output node for supplying true incoming data 
from the transmission line at the true output node 

a second transistor having a control electrode coupled to the first electrode of the trans- 
mission line to receive the data stream on the transmission line, having a first electrode connected to the 
current source and having a second electrode connected through a load to a potential source and to a 
complementary output node for supplying complementary incoming data from the transmission line at the 
complementary output node; 

a second circuit coupled to a second electrode of the differential transmission line, the second cir- 
cuit including: 

a third transistor having a control electrode coupled to receive a complementary replica of 
the outgoing data transmissions, having a first electrode connected to the current source and having a 
second electrode connected to the complementary output node; 

a fourth transistor having a control electrode coupled to the second electrode of the trans- 
mission line to receive the data stream on the transmission line, having a first electrode connected to the 
current source and having a second electrode connected to the true output node. 

A receiver for converting a serial data input stream into a parallel data output stream comprising: 
a plurality of registers for receiving the serial data input stream; 

a clock generator circuit for supplying a plurality of clock signals to the plurality of registers; 

a sampler for detecting the state of the data in the registers in response to the clock signals, and 
in response, supplying output signals indicative of the data; 

a data aligner connected to receive the output signals from the parallel sampler and in response 
thereto partition the data into words; 

a voter connected to the data aligner to detect zero crossings from the data samples to find a pos- 
ition with the most edge transitions; and 

a data separator for converting the serial data from the voter into parallel data. 



14 



EP 0 686 920 A2 



12- 



42 < 



25- 



PARALLEL 
TO SERIAL 
CONVERTER 



15. .20 ... 



40 

\ 



DATA 
SEPARATOR 



18 



/ 



DRIVER 



34 



2-WAY 
BUFFER 



30- 



OVER- 
SAMPLER & 
SERIAL TO 
PARALLEL 
CONVERTER 



TRANSMISSION 
LINE 





DRIVER 



32 



/ 
100 



10 



FIG. 1 



On-1- 
Qn; 
Qo- 
Q4- 
Q 8 - 

Qn-A- 
Qn- 

Qo- 

Q4- 



T- 



\ DUMMY 
/ LINES 



t CLOCK 
LINES 



\ DUMMY 
J LINES 



FIG. 2A 



15 



EP 0 686 920 A2 




i 

LU 
Z 
Z 
< 
X 

o 



CM 

I 

LU 

z 
z 

< 
I 
o 



LU 

z 
z 
< 
I 
o 



LU 

z 
z 
< 

X 

o 



16 



EP 0 686 920 A2 



70. 
REF / 



64 



65 
/' 



M_clk(2) 



PHASE 
FREQUENCY 
DETECTOR 



UP 



DOWN 



CHARGE PUMP 
& LOOP FILTER 




74 



CONTROL 
VOLTAGE 



FIG. 3 



17 



EP 0 686 920 A2 




18 



EP 0 686 920 A2 



J 



CD 



JF 



Li 



19 



EP0 686 920 A2 




20 



EP 0 686 920 A2. 



Si. 



LU 

o 

X 
3 



111 



-C 

o 

I 

w 

I 

x 
E 



Scvi "J 
rn " LJ 

^,0 > 

x n q 

o 



c 
ro 

03 
T3 

I 

x 
H 



3 

a 

LU 
Lt 
LL 



CO 
H. 



Q 



CO 

Q 



to 

a 



IT) 

a 



CO 

Q ' 



a 



o 
Q 



LU 05 

o 



UJ CO 

o 



LU (s - 

o 



UJ(D 

o 



LU ^ 

o 



LU 
O 



LU rt 
O 



UJC\J 

o 



LU 
O 



LU O 

O 

i 

'JZ 

o 

'I' — 

if) 
x 

3 

E 



tx 

UJ 

CL 
3 



CO 
H 

m 

o . 

QC 
LU 

5 
o 



.5 

CD 
LL 



oj 
+ 



CO 



21 



EP 0 686 920 A2 



o 



to 
n 



q 
6i 



o 
cn 

iif 

CD 
"O 

I 

X 



QQ 
CD 



X X 



r o 



o 
o 

_l 

o 



' 111 

t- 
co 
>. 
to 



to 
t- 

1- 

3 

o 

x 

■12 5 
■5 [2 

o 

CM 



CO 

1- 

D 
Q. 

X 
3 



CO 

o 



5 

I 

X 

E 



22 



EP 0 686 920 A2 




23, 



EP 0 686 920 A2 



LOAD 1 



140 . 

/' ' MIXED SIGNAL 



TRANSMISSION LINE 



144 

INVERTED i 
SENDING DATA V 



Wi 




LOAD 2 



REPLICA 
SENDING 
SIGNAL 



150 



i 



\ 

100 



145 



3 1 v Ri 



EF 



147- 



BIAS1 



NOT RECEIVED SIGNAL 



152 i 142 

155-^ ,B!AS2 



157 



FIG. 8 



24 



EP 0 686 920 A2 




25, 



EP 0 686 920 A2 




CD 



26 



EP 0 686 920 A2 



10 0 1 



I t t I I I I 

CKO CK3 CK6 



t t I i t 

CK10 CK18 CK21 

EDGE POINTER 

FIG. 11 



i_ 



CK29 



* 2 h[ 

<t>i -it 
215 



218- 



217- 



212 
213 



Y 



211 
214 



<t>2H 



si 



216 



T T_ T T 



<1> 2 H[ 



V 



FIRST STAGE 
t 



i 



i SECOND STAGE 



V 



FIG. 12 



27 



EP 0 686 920 A2 



TRANSMITTER'S 
CLOCK 



TRANSMITTER FASTER 



CK(n-1) 
CK(n) 
CK(n+1) 

REC CLK 



Y 



Y 



Y 



"N ; — v 



r 



FIG. 13A 



TRANSMITTER'S 
CLOCK 



CK(n-1) 
CK(n) 
CK(n+1) 

REC CLK 



RECEIVER FASTER 



\ r — v 



Y 



Y__7 









- 



f 



\ 1 




\ / 


r~ ■ ■ 


I / 


y 


1 



\ 7 



FIG. 13B 



28 



EP 0 686 920 A2 




29 



EP 0 686 920 A2 



i 1 

O 



I 

o 



o 



a 




a 





o 
cc 

LU 
LU 

a 

o 
c 
>» 

U5 

a. 



OJ 

o 
a. 




5 
O 
Q 
a. 



CD 



<r 

LU 
N 

2 

o 
cc 

X 

o 
z 
>- 
to 

H 
LU 
CO 
LU 

tr 



30 



EP 0 686 920 A2 



RESET 
INT_CLK 
RESETsync 



V 



^7 \ 



r 



FIG. 16 



CKO 
CK1 
CK2 

CK3 
CK4 
CK5 

CK6 
CK7 
CK8 



3 TO 1 MUX 



CK27 
CK28 
CK29 



P0 

P1 





I 

A 



A 



P2 



IF 



A 



A 



W9 WO W1 

FIG. 17 



W8 



INT CLK 




A 



31.-. 



EP 0 686 920 A2 




32' 



EP 0 686 920 A2 



/ 



y. 



/. 

i t 

PD2 PDO PD1 



PO- 



PDO 



PI 



P2- 



PD1 



PD2 



INT CLK 



OverDataO 
OverDatai 
OverData2 

OverData3 ■ 
OverData4 - 
OverDataS ■ 



■ RecDataO 



RecDatal 



— RecData2 



OverData29 



— RecData9 



FIG. 19 



33. 



EP 0 686 920 A2 



fOverPataO Q Q 

Over Data3 
Qver Pata4 J ] 
Qver Pata6 



LTran12 




n20 9 



COMPARE 



FIG. 20 



COUNT 



220 



34' 



EP 0 686 920 A2 



LTran12 



D Q 



- Q 



L 



LTran20 



D Q 
Q 



LTranOI 



D Q 
Q 

^ i 



0 D D p ° p1 p 2 



DOWN 




PDOWN 










D Q 


) — 



o 



PUP 



FIG. 21 



35 



EP 0 686 920 A2 




36 



EP 0 686 920 A2 




37 



THIS PAGE BLANK 



WSPTOj 



